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SENSITIVITY  ANALYSIS  IN  NONLINEAR  PROGRAMMING 
USING  FACTORABLE  SYMBOLIC  INPUT 

by 

Anura  de  Silva 
Garth  P.  McCormick 

1.  Introduction 

This  paper  develops  a methodology  to  utilize  the  input  to  nonlinear 
programs  in  s3mibolic  factorable  form  to  perform  first  order  sensitivity 
analysis  on  the  solution  vector  x(y)  e e’^  of  a general  nonlinear  program, 

where  ycE  is  the  parametric  vector.  In  Section  2 we  discuss  the  key 
results  of  sensitivity  analysis  theory  in  nonlinear  programming.  In  Sec- 
tion 3 we  develop  the  formulae  underlying  the  new  methodology  to  utilize 
the  symbolic  factorable  input  to  perform  sensitivity  analysis  on  the 
solution  vector  x(y)  . 

The  above  are  implemented  via  the  SUMT  algorithm  [4]  for  nonlinear 
programming  which  uses  a penalty  function  technique.  Section  4 briefly 
discusses  the  penalty  function-based  implementation  of  the  sensitivity 
analysis  methodology.  An  extrapolation  result  based  on  the  penalty 
function  parameter  r is  stated  and  incorporated  in  the  computer  code. 
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2.  Relevant  Results  from  NLP  Sensitivity  Theory 
Consitler  the  problem 


MIN 

xeE*^ 


where 


s.  t. 

G : 
H : 


F(x,y) 

G(x,y)  > 0 
h(x,y)  = 0 


_n  _k 
E X E ^ E 

E X E ->-  E*^ 


Problem  M(y) 


By  sensitivity  information  we  mean  the  rates  of  change  of  the 
(locally)  optimal  objective  function  value  and  the  corresponding  optimiz- 
ing vector  with  respect  to  changes  in  the  parametric  vector.  If  x(y) 
is  an  n-vector  and  y a k-vector,  the  first  order  sensitivity  Information 
for  Problem  M(y)  is: 

(a)  The  rate  of  change  of  the  solution  value  with  respect 

,k 


(b) 


to  the  parametric  vector,  DyF[x(y),y]  eE 

The  rate  of  change  of  the  optimizing  vector  with  respect 

dx(y) 


to  the  parametric  vector. 


8xj^(y) 


dy 


where 


dx(y) 

dy 


9Xn(y) 


3x^(y) 


axn(y) 


3yL 


(1) 


The  first  order  sensitivity  results  for  the  general  nonlinear  pro- 
gramming problem  were  first  s :ated  by  Fiacco  and  McCormick  in  [3],  Theorem  6, 
wliere  they  give  conditions  un  ler  which  a smooth  (once  continuously  differ- 
entiable, abbreviated  OCO)  tr  ijectcry  of  the-  optimizing  Kuhn-Tucker  triple 
[x(y)  ,ii(y)  ,w(y)  I exist  for  ■/  close  to  an  initial  value  . The  u 

and  w are  the  Lagrange  mult  iplier  vectors  associated  with  the  inequality 
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constraints  G and  equality  constraints  H of  Problem  M(y) , in  the 
Lagranglan, 

m p 

L(x,y,u,w)  = F(x,y)  - Z u.  G.(x,y)  + Z w.  H.(x,y)  (2) 

i=l  ^ j = l ^ J 

where  u^,G^; i = 1 , . . . ,m  and  = l,...,p  are  the  components  of  the 

mappings  G and  H and  the  Lagrange  multipliers  u and  w . Here,  u(y)  and 
w(y)  are  the  values  of  these  multipliers  associated  with  the  optimal  value 
x(y).  They  assume  the  following  to  hold;  (a)  twice  continuous  differentiability 
(abbreviated  TCD)  of  L(x,y,u,w)  in  (x,y)  in  a neighborhood  of 
(x(yQ),yQ)  , (b)  second  order  sufficiency  conditions  (abbrev.  SOSC)  at 
(x(yQ),yQ)  , (c)  linear  independence  of  binding  constraints  (abbrev.  LIB)  at 
(x(yQ),yQ)  , (d)  strict  complementary  slackness  (abbrev.  SCS) , Here,  (c)  and 
(d)  are  regularity  conditions  they  invoke  to  obtain  the  key  result  in  a useful 
form.  In  a more  recent  paper  Fiacco  [2]  obtains  a similar  result  for  some- 
what more  general  parametric  problem,  such  as  M(y)  . 

The  crux  of  the  theory  is  the  application  of  the  implicit  function 
theorem  (see  [6]  for  a general  treatment  of  the  implicit  function  theorem) 
to  the  first  order  Kuhn-Tucker  system  of  M(y)  , which  is 

L(x,y,u,w)  = 0 

Ui  G^(x,y)  = 0 , i = l,...,m  (3) 

Hj (x,y)  = 0 , j = 1, . . . ,p  . 

Consider  a system  of  equations 

n Ic 

D(x,y)  = 0 , xeE  and  yeE  Problem  P(y)  , 
n k K* 

where  D : E x E ->•  E , i being  the  number  of  scalar  valued  functions 
in  D , which  are  assumed  OCD.  This  ensures  the  existence  of  the  Jacobians 
of  D(x,y)  with  respect  to  x and  y , denoted  J^D  and  J^D  , respec- 
tively. With  the  sensitivity  analysis  context  in  view,  the  Implicit  function 
theorem  could  be  stated  as  follows: 
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n Ic  JZ. 

Theorem  1.  Suppose  D : E x E E ; (x(yQ),yQ)  is  a zero  of  D(x,y)  ; 
D(x,y)  is  OCD  near  (x(yQ),yQ)  ; J^D  is  nonsingular  at  (x(yQ),yQ)  . 

• Tl  Ic 

Then,  there  exist  open  sets  S (x(y  ))CE  and  S (y  )CE  such 

XU  y U 

that  for  any  y e there  exists  a unique  solution  for  x in  terms 

of  y , i,e.,  X = x(y)  £ continuous  in  y • Moreover 

x(y)  is  OCD  at  y^  and 


'y  ■ - [■’x-’o]'*  ty”o] 


where  the  subscript  0 on  D denotes  evaluation  at  (x(yQ) .y^)  . 


If  P(y)  is  compared  with  (3)  the  role  of  Theorem  1 in  the  con- 
text of  sensitivity  analysis  of  the  solution  vector  becomes  evident.  The 
assumed  TCD  of  the  problem  functions  gives  the  necessary  OCD  for  the  sys- 
tem in  (3) . The  additional  conditions  of  linear  independence  of  the 
binding  constraint  gradients  and  SCS  were  assumed  by  Fiacco  and  McCormick 
[3]  and  Fiacco  [2]  to  produce  desirable  results  such  as  the  uniqueness  of 
the  Lagrange  multipliers  [u(y),w(y)]  associated  with  the  optimal  x(y) 
and  the  invariance  of  the  set  of  inequality  constraints  binding  at  ^(Yq) 

in  a neighborhood  of  y^  . The  following  representation  for  the  Kuhn- 
Tucker  triple, 

X A (x,u,w) 

and  X(y)  A [x(y) ,u(y) ,w(y) ] 

will  be  adopted  for  notational  convenience.  For  the  same  reason,  the  first 
order  Kuhn-Tucker  system  of  (3)  will  be  denoted  as  follows. 

R(x,u,w,y)  = 0 

or  (3b) 

R(;:,y)  = 0 . 
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Differentiating  (3b)  partially  with  respect  to  X yields  the 
Jacobian  of  (3b)  or  (3)  with  respect  to  X , 


JjjR(X,y) 


2 ' 

V L 1 

XX 

- ^x^l’*  • - 

1 

V G , 
X m 

V H,  , • • 

X 1’ 

— 

V H 

X p 

T ' 

u V G 1 
1x1 

1 

1 

0 1 

1 

u V^G  ' 
m X m , 

0 

1 

1 

' 

ni 

1 

0 

V^H  ' 

X 1 1 

0 

1 

1 

1 

1 

1 

0 

T ' 

V H , 

X p 

1 

1 

1 

1 

_ 

Differentiating  (3b)  partially  with  respect  to  y yields  the 
Jacobian  with  respect  to  this  parametric  vector. 


J R(X 

y 


= [ 


u,V  G_,  H,  , . . . ,V 


yx  ’ 1 y 1 ■ ’ m y m ’ y 1 ' ' y P 

At  a Kuhn-Tucker  point,  (3b)  could  be  written  as 

R[X(y),y]  = 0 . 

Totally  differentiating  (3c)  with  respect  to  y , the  following 
relationship  is  obtained. 

JxR(X(y),y)  + JyR(X(y),y)  = 0 . 

The  principal  result  immediately  follows. 

= " ‘^j(R(X(y)  ,y)  ^ • JR(X(y),y)  . 


(4) 


(5) 


(3c) 


(6) 


(7) 
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liquation  (7)  is  almost  the  same  as  the  result  stated  under  Theorem  1 but 
is  stated  with  respect  to  the  entire  Kuhn-Tucker  triple,  X . Note  that  (7) 
is  stated  for  y in  a neighborhood  of  , whereas  the  result  of  Theorem  1 

was  only  at  y^  . This  is  possible  due  to  the  slightly  stronger  assumptions 

made  by  Fiacco  and  McCormick  [3]  and  Fiacco  [2]. 


The  above  result  is  the  basis  for  the  development  of  a methodology  for 
sensitivity  analysis  in  nonlinear  programming.  The  computational  technique  used 
here  is  the  penalty  function  method.  The  same  result  and  technique  were  used 
by  Armacost  and  Mylander  [1]  to  compute  (1),  the  principal  difference  in 
their  methodology  being  that  they  approximate  J^R  by  central  differencing 


techniques,  and  compute  all  other  derivatives  manually.  The  relevant  for- 
mulae for  the  penalty  function  implementation  using  the  logarithmic  barrier- 
quadratic  loss  penalty  function  of  the  SUMT  algorithm  [4]  are  discussed  in 
Section  3. 


3 . Sensitivity  Analysis  With  Symbolic  Input 

The  factorable  input  technique  and  computer  code  to  compute  gradient 
vectors  and  Hessian  matrices  of  problem  functions  was  developed  by  McCormick 
[5]  to  facilitate  the  use  of  nonlinear  programming  computer  codes.  The  code 
itself  has  since  been  extended  by  de  Silva  and  McCormick  to  accept  input  in 
a symbolic  format.  This  symbolic  code  has  been  used  to  solve  several  gen- 
eral nonlinear  programming  problems  and  the  ease  with  which  a problem  input 
can  be  supplied  or  modifications  done  is  very  encouraging.  A sensitivity 
analysis  option  turned  out  to  be  a natural  extension  to  the  symbolic  computer 
code . 

A reduced  version  of  the  matrix  in  (1)  which  provides  substantial 
computational  economy,  results  from  the  formulae  which  are  developed  below; 
a brief  discussion  of  the  factorable  input  philosophy  precedes  this  develop- 
ment . 


In  the  factorable  input  method,  the  problem  functions  are  built  up 
recursively  as  sums,  or  sums  of  pairwise  products,  of  simple  transformations 
of  previously  defined  functions.  initially,  these  simple  transformations 


6 


T-365 


1 

I 

i 

are  applied  to  the  original  problem  variables  , 1 = l,...,n  , and  the  j 

i 

recursive  application  of  the  above  procedure  (shown  mathematically  in  | 

Equation  (8))  generates  new  functions  , i = n+l,...,N  . ! 

1 

McCormick  [5]  gives  a detailed  exposition  illustrated  by  an  j 

example.  He  sets  , 1 = 1, ... ,n  and  refers  to  the  entire  set  | 

. i 

of  X , i = 1,...,N  as  concomitant  variable  functions  (abbrev.  CVFs) • i 

] 

The  general  equation  for  the  creation  of  a new  CVF  is: 

i-1  . i-1  p . 

X\x)  = X T^(xP(x)]  + X X [xP(x)]  • rx'^Cx)]  . (8)  • 

p=l  P p=l  q=l  ‘1-P  P’'!  < 

In  the  presence  of  a parametric  vector  y , the  equation  becomes 
i-1 

X^(x,y)  = X Tp[xP(x,y) ,y] 
p=l  P 

r . j 

+ X X „[xP(x,y),y]  • U rx'*(x,y),y]  . (9) 

p=l  q=l  ^’P  P’'^ 

The  terms  in  Che  first  summation  of  (8)  or  (9)  are  referred  to  as 
separable  terms  comprising  the  new  CVF  X^(»)  and  the  terms  of  the  j 

second  summation  of  pairwise  products  are  called  quadratic  terms.  A point  j 

of  note  in  (9)  is  that  the  arguments  of  the  simple  transformation  ^ 

T^  , or  contain  v directly  as  well  as  indirectly  via  the 

P <1.P  P»9 

argument  CVF.  The  direct  presence  of  the  parametric  vector  y is  restricted 
to,  at  most,  two  components  because  all  the  simple  transformations  used 
in  the  methodology  and  code  contain,  at  most,  two  constant  terms.  The  para- 
metric components  are  a subset  of  these  constants. 

An  examination  of  (7)  suggests  that  second  partial  derivatives  of 
the  problems  functions  with  respect  to  x and  the  cross  second  partial 
derivatives  with  respect  to  x and  y are  required.  This  is  because  R 
already  contains  first  partial  derivatives  with  respect  to  x . What  is 
developed  below  will  yield  the  matrix  in  (1)  reduced  by  postmultiplying  it 
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with  a known  column  vector  B e E or  premultiplying  it  with  a known 
row  vector  a e E*'  . The  postmultiplication  produces  an  n-vector,  each 
component  being  a weighted  sum  of  the  k partial  derivatives  of  a par- 
ticular x^(y)  , with  respect  to  the  components  of  the  parametric  vector. 


The  weights  are  the  components  of  B £ E . The  ith  component  of  this 
resulting  n-vector  can  be  looked  upon  as  the  "total  sensitivity"  of  the 
variable  x^(y)  (1=1,..., n)  with  respect  to  all  the  y^  , j = 1, k , 

weighted  according  to  B^  , j = 1 , . . . ,k  . The  rth  component  of  the 


n-vector  is 


k 

E 

j=l 


Xj.(y) 


3.  . 

1 


The  prcmultiplication  produces  a k-component 


vector,  the  jth  component  being  the  "total  sensitivity"  of  a composite  of 
all  n variables  x^(y)  , i = l,...,n  , weighted  according  to  , i = 1, 

with  respect  to  y^ (j=l, . . . ,k)  . The  ith  component  of  this  k-vector 
n 9x . 

would  be  7.  -r — a.  . 

3yn  1 


For  clarity  of  exposition,  (9)  will  be  simplified  below  to  a 
separable  portion  of  one  term  or  a quadratic  term  of  one  product. 

Consider  the  separable  case  first.  Suppose 

X^(x,y)  = T[X*’(x,y)  ,y]  . 

Letting 

T = T [X*’(x,y),y]  ^ jT[xP(x,y)  ,y]  ^ 

ax^ 

we  have 

V X^(x,y)  = V X*’(x,y)  • . 

^ ^ 9XP 


(10) 


(11) 


(12a) 


= f • xP(x,y)  . 


(12b) 


- 8 - 


T-365 


In  (12b)  T is  readily  available  since  all  transformations  are  simple 
ones,  and  V X^(x,y)  is  built  up  inductively  from  its  argument  CVF. 

2 ^ 

The  expressions  for  (x,y)  are  already  developed  and  coded  in 

[5],  and  will  not  be  repeated  here.  Note  that 

,2  „i. 


Vy^X^(x.y)  = VyV^X^x.y) 


DyD^  T[x‘^(x,y),y]  . 


(13a) 

(13b) 


In  (13b)  D denotes  total  differentiation,  although  in  the  previous 
step  the  partial  differentiation  V was  used.  This  is  due  to  the  fact 
that  the  partial  differentiation  in  (13a)  would  capture  the  total  variation  due 
to  X and  y of  the  expression  in  (12a),  but  not  so  in  (13b).  Continuing 
the  differentiation  with  respect  to  y of  (12b),  as  in  (13b),  we  get 


7y^x\x,y)  = V^xP(x,y)  • T + V^xP(x.y)  | T rxP(x,y)  ,y]  vV(x,y) 


tI  . 

y J 


2 

In  the  above,  Y = Y [x’^(x,y),y]  = .y]  ^ 


3(xP)^ 


(14) 


(15) 


In  (lA),  f and  T are  readily  available  scalar  quantities.  The  quantity 
T [X  (x,y),y]  is  the  true  partial  derivative  unlike  in  (13b).  This  is 
easy  to  compute  because  y enters  the  f argument  directly,  at  most,  in 
two  nonzero  components.  The  expression  for  V^X^(x,y)  has  been  derived 

already  in  (12a).  The  only  remaining  terms  are  7^  X^(x,y)  which  is  built 

yx 

T D 

up  inductively,  and  V^X^(x,y)  which  is  discussed  below. 
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Now 

7yX^(x,y)  = Dy  T[xP(x,y).y] 

= T[x'’(x,y),y]  + f • 7yXP(x,y)  . 


(16) 


Note  7^T(:c-^(x,y)  ,y)  Is  the  true  partial  derivative  and  will  have  at  most 

two  nonzero  components  in  which  are  easily  computable.  Thus  (16)  per- 

mits  the  computation  of  the  ^yX^(x,y)  term  of  (14)  by  induction.  Since 

the  formulae  of  (12b),  (14),  (16)  are  initially  applied  to  the  original 

problem  variables  X^  = x^  , i = 1 n , we  have,  for  p <_  n : 


7^X^(x,y) 


e • T 
P 


where 


A1  so , 


is  a unit  vector  e E”  with  unity  in  the  jth  position. 

X^(x,y)  = x’^  + e [V  v'^’x^  + T] 

yx  yx  p'  y ' 


T • 

= e V"-  T . 

p y 

Finally, 

VyX^(x,y)  = 7yT  + T 


(17) 


(18) 


(19) 


= V T . 

y 

On  page  6 , we  spoke  of  dimensionally  reducing  the  matrix  of  (1)  in 
two  separate  ways.  The  case  of  premultiplying  by  a e E”  is  considered 
below. 


Thus,  from  (17)  and  (18)  letting  Op  denote  the  pth  component  of  a , 


we  get 


a^V  X^(x,y)  = a • T . 

X ^ p 


(20) 


9 1 T • 


(21) 


The  summation  of  (21)  will  have,  at  most,  two  nonzero  components  due 
to  the  nature  of  the  simple  transformations  used.  Next,  consider  the  case 
of  p > n . Here, 
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1 


Next, 


X^(x.y)  = a'*^  7^  xP(x,y)  • T • 
7^  X^(x,y)  = 7^  X^Cx.y)  • T 
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(22) 


+ a"^  7^  xP(x,y)  |t  7^  X^Cx.y)  + 7^  t}  . (23) 

Equation  (22)  works  purely  Inductively  and  does  not  require  the 
deliberate  introduction  of  a , as  this  has  been  done  in  (20).  It  is  a 
scalar  quantity  for  each  i • The  first  term  on  the  right-hand  side  of 
(23)  is  also  an  inductive  term.  As  far  as  the  second  term  is  concerned, 

X ID 

(the  scalar  multiplier  a 7^X*^(x,y)  , the  only  presence  of  a there,)  it  is 

built  up  inductively.  The  expression  within  the  curly  bracket  is  a 
k-vector,  the  first  term  being  an  inductive  one  as  demonstrated  in  (16) 
and  the  second  possessing,  at  most,  two  nonzero  components. 

In  summary,  we  have  a scalar  quantity  for  the  reduced  version  of 
the  gradient  vector  of  each  CVF  i , and  (20)  (or  (17),  if  p ^ n)  has  to 
be  repeated  for  every  p corresponding  to  the  separable  terms  of  , 
and  added  together.  The  reduced  second  cross  partial  of  each  CVF  i 
(as  shown  in  (21)  or  (18)  depending  on  whether  p > n)  is  a k-vector  for 
each  component  p , which  must  be  added  together  as  before. 

The  quadratic  components  of  (9)  will  be  addressed  next.  Considering 
the  creation  of  an  X^  , 1 .=  n+l,...,N  , using  a single  product,  we  have 

X^(x,y)  = V[xP(x,y)  ,yl  * u[x‘’(x,y)  ,yl  , (24) 

then, 

7^X^(x,y)  = D^V[xP(x,y),y]  * U[x‘>(x,y)  ,y] 

+ V[xP(x,y),y]  * D^U[x‘’(x,y),y]  (25) 

= VU  • 7 xP(x,y)  + VU  • 7 x‘’(x,y)  . (26) 

X X 


11 


T-365 


I 

I 

I 

i 


• • • 

Now  V , U are  defined  analogously  to  T (see  (11)).  Since 

* • n 

V , V , U , U are  scalar  quantities,  the  premultiplication  by  a e E 

could  be  done  as  in  (20)  and  (22)  depending  on  whether  p and  q 
are  ^ or  > n . The  separate  terms  in  (25)  must  be  added  together. 
The  result  is 

a^V^X^(x,y)  = W a^V^X*’(x,y)  + VU  a^V^x‘’(x,y)  . 

When  p (or  q)  is  > n , cx^V  X^  (or  a'^7  x'^)  enters  purely 

X X 

• • 

inductively.  Otherwise,  the  term  simplifies  to  VU  (or  VU  “q) • 

The  second  cross  partial  derivative  is  obtained  by  differentiating  (26) 

with  respect  to  y . The  arguments  of  and  are  omitted  for 

convenience  of  representation.  Finally, 

V^  X^(x,y)  = uiv*V^  xP  + V xP[V*V^X*’  + W]\ 

yx  ' I yx  X ^ y y J 

+ V • V xP  • D U 
X y 

+ V |u-v^  x'^  + V x''t’u*v^x‘’  + U]i 
I yx  X " y y ) 

+ U • V x'*  • D V . 

X y 

Premultiplying  (27)  by  a e , and  rearranging, 

X^(x,y)  = U | a V X^  • V + a^V  X^fv  • V^X^  + V]l 
yx  -^  (yx  X y y ; 

+ V ia^V^  x'’  • U + a^V  x'^fu  • V^  U]i 
( yx  X ' y 

T P • T a • 

+ a V X • D U • V + a V X^  • V D • U . 

X y X y 


(26)  ! 

I 


(27) 


(28) 


The  evaluation  of  the  portions  within  the  curly  brackets  in  (28)  is 
identical  to  the  evaluation  of  the  cross  second  partials  in  the  separable 
analysis,  i.e.,  the  right-hand  side  of  (23).  The  last  two  terms  of  (26) 

contain  two  quantities  a^V  X^  and  a^V  X*^  which  are  developed  by 

X X 

induction  as  demonstrated  by  (12)  and  (20).  The  quantities  D^U  and  D^V 
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are  analogous  to  D^T  whose  computation  is  shown  in  (16) . The  only 
remaining  expression  to  be  developed  is  that  for  « 

V^(x,y)  = D {V[xP(x,y),y]  U[x‘^(x.y)  .y] } 
y y (25 

= U{V*7  xP+7  V}  + V{U  7 xVv  U}  . 

y y y y 

Examination  of  (23)  and  (28)  shows  that  it  is  not  necessary  to 
premultiply  the  expressions  of  (16)  and  (29)  by  , since  V^X  never 

gets  directly  premultiplied  by  it. 

The  above  Equations  (20)  through  (29)  have  developed  the  necessary 
components  to  perform  the  computations  necessary  to  produce  a row  k-vector 
T 

a [dx(y)/dy|  . The  next  section  describes  how  these  are  implemented  via  a 
penalty  function  method. 

4 . Computation  of  Sensitivity  Results 

The  SUMT  computer  code  [ 4 ] is  used  in  conjunction  with  the  factorable 

T 

symbolic  sensitivity  analysis  code  to  yield  [a  dx(y)/dy].  The  code  uses  an 
interior-exterior  penalty  function  with  a single  parameter  r , and  generates 
a sequence  of  subproblems  which  will  be  denoted  as  M(y,r)  , which  tend  to 
M(y)  as  r ->■  0 . In  straight  nonlinear  optimization  SUMT  uses  an  extra- 
polation based  on  r to  estimate  the  relevant  values  at  r = 0 . A similar 
extrapolation  technique  is  used  here  to  provide  first  order  estimates  of 

[a^dx(y)/dy|  . 

A logarithmic  barrier  quadratic  loss  penalty  function  is  considered 
below.  For  M(y) , this  penalty  function  is 

2 

m p H (x,y) 

P[x,y,r]  = F(x,y)  - r I in  G (x,y)  + I -J— . (3C 

i=l  ^ j=l 


The  problem  solved  at  the  ith  stage  of  the  sequential  procedure,  for 


a value  r 


0 

r*-  IS 


MIN  P[x,y,r  ] 

„0 

xeR 


Problem  M(r,y) 


where 


R = {x|G^(x,y)  > 0 , i=l,...,m} 
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The  optimality  condition  is  that  the  gradient  with  respect  to  x 
vanishes,  i.e., 

= 0 . 

At  local  optima,  for  a general  value  of  r we  can  write 
V^P[x(y,r),y,r]  H 0 . 

Applying  this  to  (30), 


V/(x<y.r),y)  - 


V n [x(y,r),yl 


P 2 H tx(y,r),y] 

+ j;  V^Hjx(y,r),y] 

1=1  ^ 


x“j  ' 


H 0 


Differentiating  (33)  partially  with  respect  to  y , we  get 


7"  P[x(y,r),y,r] 
yx 

= V F[x(y,r),y]  - Z 7^  G [x(y,r),y]  • — ■/ — r — r 
yx  yx  i'  G^[x(y,r),y) 


m 

+ Z 


1=1  G^[x(y,r),y] 


2 ' ‘FI  * ^yGj^[x(y,r),y] 


+ Z 7"  H [x(y,r),y] 
j=l  ^ ^ 


2 H^[x(y,r),y] 


Z (-)  7 H 


j[x(y,r),y]  • 7^Hj [x(y,r) ,y]  . 


Differentiating  (33)  totally  with  respect  to  y , 

V^/[x(y,r),y.r]  ^ ,r)  ,r]  = 0 


(31) 


(32) 


(33) 


(3A) 


- lA  - 


(35) 
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The  argument  variables  within  the  square  brackets  will  be  omitted  when 

referring  to  the  terms  in  the  above. 

2 2 
has  been  developed  elsewhere  by  McCormick  [5].  Those  for 

are  developed  using  the  Kquation  (34).  Thus, 

- 7^/[x(y,r),y,r]“^  • V^P[x(y,r)  ,y,r]  . (36 

The  similarity  between  (7)  and  (36)  is  to  be  noted.  Suppose  (36)  is 
premuJ  tiplied  by  some  a e E*'  . Then,  we  get 


J dx(x.r).  = _ „T  . vJ^P[x(y,r),y,r]  , (37) 

where 

= 0^  • V^^P[x(y,r) ,y,r]  ^ . (38) 

If  o is  a known  vector,  a is  determined  because  the  Hessian  of  the 
penalty  function  with  respect  to  x is  known  by  [5].  Then  (37)  is  determined 
by  premultiplying  (34)  by  this  a , and  using  (20),  (21),  (22),  (23),  (27)  and 
(29)  to  determine  the  individual  components  of  the  resulting  equation.  This 
can  be  done  because  every  problem  function  F , ; i = l,...,m  , 

IL  ; i = l,...p  , can  be  identified  with  a CVF  , i = l,...,n  , 

n+1, . . . ,N  . 

The  final  aspect  remaining  to  be  discussed  in  the  context  of  the 

T <ix(y  r) 

sensitivity  methodology  is  the  improvement  of  the  estimate  of  a — by 

extrapolating  on  r , at  the  )lth  sequential  unconstrained  problem,  to 
r = 0 . A Taylor  series  approximation  applied  at  the  ith  and  (i-Dth 
siibproblems  yields 
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dxCy.r*^)  • dx(y,0)  + ^ rdx(y,0)1 

dy  dy  dr  dy  J 

dx(y,r^~^)  • dx(y,0)  ^ ^ f dx(y,0) 

dy  ~ dy  dr  dy 


(39) 

(40) 


Eliminating  the  last  term  in  (39)  and  (40) 

■d(x(y,0))/  )t-l  _ l\  • 1-1  ^ dx(y.r^)  _ t.  ^ dx(y,r^~^) 

dy  \ / dy  dy 


(41) 


£_1  0 

from  which,  (letting  r = c • r ) we  get 


dx(y ,0) 
dy 


dx(y^r*')  dx(yj^ 


dy 


"dy  . ' 


(42) 


Hence  by  performing  the  computation  for  successive  subproblems  when 
r becomes  sufficiently  small,  (42)  can  be  used  to  estimate  the  value  for 
the  original  problem,  i.e.,  r = 0 . 
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To  cope  with  the  expcmclini>  technology,  oiir  society  must 
be  assurer!  ot  a continuing  su(:)ply  ol  rigorously  trained 
and  educated  engineers.  The  School  ol  Engineering  and 
Applied  Science  is  completely  committed  to  this  ob- 
jective. 
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